智能论文笔记

Flexible Table Recognition and Semantic Interpretation System

Marcin Namysl , Alexander M. Esser , Sven Behnke , Joachim Köhler

分类：计算机视觉

2021-05-25

表提取是一个重要但仍未解决的问题。在本文中，我们介绍了一种柔性和模块化的台式提取系统。我们开发了两个基于规则的算法，执行完整的表识别过程，包括表检测和分割，并支持最常见的表格格式。此外，为了纳入语义信息的提取，我们开发了一种基于图形的表解释方法。我们对挑战表识别基准ICDAR 2013和ICDAR 2019进行了广泛的实验，实现了与最先进的方法竞争的结果。我们完整的信息提取系统展出了0.7380的高F1得分。为了支持未来的信息提取研究，我们将来自我们的表解释实验，使资源（地面诠释，评估脚本，算法参数）公开可用。

translated by 谷歌翻译

Deep Learning Models for River Classification at Sub-Meter Resolutions from Multispectral and Panchromatic Commercial Satellite Imagery

Joachim Moortgat , Ziwei Li , Michael Durand , Ian Howat , Bidhyananda Yadav , Chunli Dai

分类：计算机视觉 | 机器学习

2022-12-27

Remote sensing of the Earth's surface water is critical in a wide range of environmental studies, from evaluating the societal impacts of seasonal droughts and floods to the large-scale implications of climate change. Consequently, a large literature exists on the classification of water from satellite imagery. Yet, previous methods have been limited by 1) the spatial resolution of public satellite imagery, 2) classification schemes that operate at the pixel level, and 3) the need for multiple spectral bands. We advance the state-of-the-art by 1) using commercial imagery with panchromatic and multispectral resolutions of 30 cm and 1.2 m, respectively, 2) developing multiple fully convolutional neural networks (FCN) that can learn the morphological features of water bodies in addition to their spectral properties, and 3) FCN that can classify water even from panchromatic imagery. This study focuses on rivers in the Arctic, using images from the Quickbird, WorldView, and GeoEye satellites. Because no training data are available at such high resolutions, we construct those manually. First, we use the RGB, and NIR bands of the 8-band multispectral sensors. Those trained models all achieve excellent precision and recall over 90% on validation data, aided by on-the-fly preprocessing of the training data specific to satellite imagery. In a novel approach, we then use results from the multispectral model to generate training data for FCN that only require panchromatic imagery, of which considerably more is available. Despite the smaller feature space, these models still achieve a precision and recall of over 85%. We provide our open-source codes and trained model parameters to the remote sensing community, which paves the way to a wide range of environmental hydrology applications at vastly superior accuracies and 2 orders of magnitude higher spatial resolution than previously possible.

translated by 谷歌翻译

Is it worth it? An experimental comparison of six deep- and classical machine learning methods for unsupervised anomaly detection in time series

Ferdinand Rewicki , Joachim Denzler , Julia Niebling

分类：机器学习 | 人工智能

2022-12-21

The detection of anomalies in time series data is crucial in a wide range of applications, such as system monitoring, health care or cyber security. While the vast number of available methods makes selecting the right method for a certain application hard enough, different methods have different strengths, e.g. regarding the type of anomalies they are able to find. In this work, we compare six unsupervised anomaly detection methods with different complexities to answer the questions: Are the more complex methods usually performing better? And are there specific anomaly types that those method are tailored to? The comparison is done on the UCR anomaly archive, a recent benchmark dataset for anomaly detection. We compare the six methods by analyzing the experimental results on a dataset- and anomaly type level after tuning the necessary hyperparameter for each method. Additionally we examine the ability of individual methods to incorporate prior knowledge about the anomalies and analyse the differences of point-wise and sequence wise features. We show with broad experiments, that the classical machine learning methods show a superior performance compared to the deep learning methods across a wide range of anomaly types.

translated by 谷歌翻译

Découvrir de nouvelles classes dans des données tabulaires

Colin Troisemaine , Joachim Flocon-Cholet , Stéphane Gosselin , Sandrine Vaton , Alexandre Reiffers-Masson , Vincent Lemaire

分类：机器学习

2022-11-28

In Novel Class Discovery (NCD), the goal is to find new classes in an unlabeled set given a labeled set of known but different classes. While NCD has recently gained attention from the community, no framework has yet been proposed for heterogeneous tabular data, despite being a very common representation of data. In this paper, we propose TabularNCD, a new method for discovering novel classes in tabular data. We show a way to extract knowledge from already known classes to guide the discovery process of novel classes in the context of tabular data which contains heterogeneous variables. A part of this process is done by a new method for defining pseudo labels, and we follow recent findings in Multi-Task Learning to optimize a joint objective function. Our method demonstrates that NCD is not only applicable to images but also to heterogeneous tabular data.

translated by 谷歌翻译

Sensor Visibility Estimation: Metrics and Methods for Systematic Performance Evaluation and Improvement

Joachim Börger , Marc Patrick Zapf , Marat Kopytjuk , Xinrun Li 2 , Claudius Gläser

分类：计算机视觉 | 机器人

2022-11-11

Sensor visibility is crucial for safety-critical applications in automotive, robotics, smart infrastructure and others: In addition to object detection and occupancy mapping, visibility describes where a sensor can potentially measure or is blind. This knowledge can enhance functional safety and perception algorithms or optimize sensor topologies. Despite its significance, to the best of our knowledge, neither a common definition of visibility nor performance metrics exist yet. We close this gap and provide a definition of visibility, derived from a use case review. We introduce metrics and a framework to assess the performance of visibility estimators. Our metrics are verified with labeled real-world and simulation data from infrastructure radars and cameras: The framework easily identifies false visible or false invisible estimations which are safety-critical. Applying our metrics, we enhance the radar and camera visibility estimators by modeling the 3D elevation of sensor and objects. This refinement outperforms the conventional planar 2D approach in trustfulness and thus safety.

translated by 谷歌翻译

Rmagine: 3D Range Sensor Simulation in Polygonal Maps via Raytracing for Embedded Hardware on Mobile Robots

Alexander Mock , Thomas Wiemann , Joachim Hertzberg

分类：机器人

2022-09-27

传感器仿真已成为一种有前途且强大的技术，可以找到许多现实世界机器人任务（例如本地化和姿势跟踪）的解决方案。但是，常用的模拟器具有高硬件要求，因此主要用于高端计算机。在本文中，我们提出了一种方法，可以直接在使用三角形网格作为环境图的移动机器人的嵌入式硬件上模拟范围传感器。这个名为Rmagine的库允许机器人直接通过射线缩放模拟传感器数据为任意范围传感器。由于机器人通常只有有限的计算资源，因此Rmagine的目的是灵活且轻巧，同时甚至可以很好地扩展到大型环境图。它通过将统一的API放在硬件制造商提供的特定专有库上，将统一的API放置在诸如Nvidia Jetson之类的多个平台上，例如Nvidia Jetson。这项工作旨在根据范围数据的模拟来支持机器人应用程序的未来开发，这些数据以前在移动系统上的合理时间内无法计算。

translated by 谷歌翻译

Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

Đorđe Miladinović , Kumar Shridhar , Kushal Jain , Max B. Paulus , Joachim M. Buhmann , Carl Allen

分类：机器学习

2022-09-26

原则上，将变异自动编码器（VAE）应用于顺序数据提供了一种用于控制序列生成，操纵和结构化表示学习的方法。但是，训练序列VAE具有挑战性：自回归解码器通常可以解释数据而无需使用潜在空间，即后置倒塌。为了减轻这种情况，最新的模型通过将均匀的随机辍学量应用于解码器输入来削弱强大的解码器。从理论上讲，我们表明，这可以消除解码器输入提供的点式互信息，该信息通过利用潜在空间来补偿。然后，我们提出了一种对抗性训练策略，以实现基于信息的随机辍学。与标准文本基准数据集上的均匀辍学相比，我们的目标方法同时提高了序列建模性能和潜在空间中捕获的信息。

translated by 谷歌翻译

Sequential Causal Effect Variational Autoencoder: Time Series Causal Link Estimation under Hidden Confounding

Violeta Teodora Trifunov , Maha Shadaydeh , Joachim Denzler

分类：机器学习

2022-09-23

在存在潜在变量的情况下，从观察数据中估算因果关系的效果有时会导致虚假关系，这可能被错误地认为是因果关系。这是许多领域的重要问题，例如金融和气候科学。我们提出了序性因果效应变异自动编码器（SCEVAE），这是一种在隐藏混杂下的时间序列因果关系分析的新方法。它基于CEVAE框架和复发性神经网络。通过基于Pearl的Do-Calculus使用直接因果标准来计算因果链接的混杂变量强度。我们通过将其应用于具有线性和非线性因果链接的合成数据集，以显示SCEVAE的功效。此外，我们将方法应用于真实的气溶胶气候观察数据。我们将我们的方法与在合成数据上有或没有替代混杂因素的时间序列变形方法进行比较。我们证明我们的方法通过将两种方法与地面真理进行比较来表现更好。对于真实数据，我们使用因果链接的专家知识，并显示正确的代理变量的使用如何帮助数据重建。

translated by 谷歌翻译

Jeopardy: An Invertible Functional Programming Language

Joachim Tilsted Kristensen , Robin Kaarsgaard , Michael Kirkedal Thomsen

分类：自然语言处理

2022-09-06

一种算法描述了将问题转化为解决方案的一系列步骤。此外，当倒置序列定义明确时，我们说算法是可逆的。虽然可以用通用语言描述可逆算法，但通常无法通过此类语言对可逆性进行保证，因此确保可逆性需要额外（通常是非平凡）的证据。另一方面，尽管可逆编程语言可以通过将允许的操作限制为本地可逆的操作来确保其程序可逆，但以可逆风格编写程序可能会很麻烦，即使实现的算法是，也可能与常规实现有很大差异，实际上，可逆。在本文中，我们介绍了一种功能性编程语言Jeopardy，可以保证程序的可逆性而不会施加本地可逆性。特别是，危险允许有限使用不可避免的 - 甚至不确定性！ - 操作，只要它们以静态确定为可逆的方式使用。但是，保证可逆性并不明显。因此，我们概述了可以提供部分静态保证的三种方法。

translated by 谷歌翻译

A Method for Discovering Novel Classes in Tabular Data

Colin Troisemaine , Joachim Flocon-Cholet , Stéphane Gosselin , Sandrine Vaton , Alexandre Reiffers-Masson , Vincent Lemaire

分类：机器学习

2022-09-02

在新颖的类发现（NCD）中，目标是在一个未标记的集合中找到新的类，并给定一组已知但不同的类别。尽管NCD最近引起了社区的关注，但尽管非常普遍的数据表示，但尚未提出异质表格数据的框架。在本文中，我们提出了TabularNCD，这是一种在表格数据中发现新类别的新方法。我们展示了一种从已知类别中提取知识的方法，以指导包含异质变量的表格数据中新型类的发现过程。该过程的一部分是通过定义伪标签的新方法来完成的，我们遵循多任务学习中的最新发现以优化关节目标函数。我们的方法表明，NCD不仅适用于图像，而且适用于异质表格数据。进行了广泛的实验，以评估我们的方法并证明其对7种不同公共分类数据集的3个竞争对手的有效性。

translated by 谷歌翻译